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A suite of reduced-dimensionality 13 C, 15 N, ^-triple-resonance NMR 
experiments is presented for rapid and complete protein resonance assignment 
Even when using short measurement times, these experiments allow one to 
retain the high spectral resolution required for efficient automated analysis. 
"Sampling limited" and "sensitivity limited" data collection regimes are defined, L 
respectively, depending on whether the sampling of the indirect dimensions or the sensitivity of a 
multidimensional NMR experiments per se determines the minimally required measurement time. We 
show that reduced-dimensionality NMR spectroscopy is a powerful approach to avoid the "sampling 
limited regime"— i.e., a standard set of ten experiments proposed here allows one to effectively adapt 
minimal measurement times to sensitivity requirements. This is of particular interest in view of the greatly 
increased sensitivity of NMR spectrometers equipped with cryogenic probes. As a step toward fully 
automated analysis, the program AUTOASSIGN has been extended to provide sequential backbone and 13 C 
* resonance assignments from these reduced-dimensionality NMR data. 
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Rapid resonance assignment is a prerequisite for high-throughput (HTP) j 
structure determination and structural genomics (I). The aims of structural j 
genomics are to (/) explore the naturally occurring "protein fold space' 1 and (//) | 
contribute to the characterization of function through the assignment of jj 
atomic-resolution three-dimensional (3D) structures to proteins. The ultimate ll 
goal is to provide one or more representative 3D structures for every structural domain family in nature. 
It is now generally acknowledged that NMR will play an important role in this endeavor (1). The resulting 
demand for HTP structure determination requires fast and automated NMR data collection and analysis 
protocols. This impetus for the development of new methods will have broad impact in the technological 
infrastructure for structural biology and molecular biophysics. 

Two key objectives for NMR data collection can be identified. Firstly, the measurement time should be 
minimized so as to lower the cost per structure and relax the constraint that NMR samples need to be 
stable over long time periods. Secondly, automated analysis requires recording of a redundant set of 
NMR spectra each affording good resolution, while it is also desirable to keep the total number of spectra 
small to reduce complications due to interspectral variations of chemical shifts (2). This second objective 
can be addressed by maximizing the dimensionality of the spectra. However, the joint realization of the 
first and second objective is impeded by the large lower bounds for measurement times of four (or higher) 
dimensional NMR spectra arising from the independent sampling of three (or more) indirect dimensions. 

We distinguish "sampling limited" and "sensitivity limited" data collection regimes, depending on whether 
the sampling of the indirect dimensions or the sensitivity of the multidimensional NMR experiments per se 
determines the minimally achievable measurement time. Because structure determinations rely on nearly 
complete shift assignments routinely obtained using ^C^N^H-triple-resonance (TR) NMR (3), the 
development of techniques that avoid the sampling limited regime represents an important challenge. 
Reduced-dimensionality (RD) TR NMR experiments (4-7), designed for simultaneous frequency labeling 
of two spin types in a single indirect dimension, offer a viable strategy to circumvent sampling-limited 
recording of NMR spectra. RD NMR is based on a projection technique for reducing the spectral 
dimensionality: the chemical shifts of the projected dimension give rise to a cosine-modulation of the 
transfer amplitude, yielding peak doublets encoding n chemical shifts in an n-l dimensional spectrum. 
Thus, for example, four-dimensional (4D) information can be obtained in a 3D experiment. This reduces 
the sampling requirements and the minimal measurement time by about an order of magnitude (7), which 
allows recording projected 4D experiments within a few hours while retaining maximal evolution times 
and thus a resolution routinely achieved in conventional 3D NMR spectra. Furthermore, axial coherences, 
arising from either incomplete polarization transfer or steady-state heteronuclear magnetization, can be 
observed as peaks located at the center of the doublets (6). This allows both the unambiguous assignment 
of multiple doublets with degenerate chemical shifts in the other dimensions and the identification of cross 
peak pairs by symmetrization of spectral strips about the position of the central peak. RD NMR 
experiments were the first designed to simultaneously recruit both ] H and heteronuclear magnetization 
for signal detection (6), and RD two-spin coherence NMR spectroscopy (8) serves as a valuable 
radio-frequency (rf) pulse module for measurement of cross-correlated heteronuclear relaxation rates (9). 
Here we present a suite of nine RD TR NMR experiments (six of which are unique implementations) for 
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complete protein resonance assignment. To integrate the RD NMR technology for rapid assignment, we 
have, as a step toward fully automated analysis, extended the program AUTOASSIGN (10, H.) for the use 
ofRD NMR spectra. 



► LMaterials^and Methods 
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NMR measurements were performed at 25°C on a Varian Inova 

600 spectrometer by using a 1-mM solution of 13 C/ 15 N-labeled "Z-domain» of || - *™^2 d Methods 
the 71 -residue Staphylococcal protein A (12) in 90% H 2 O/10% D 2 0 (20 mM j ^ Results and Discussion 
K-P0 4 ; pH, 6.5). The protein's overall rotational correlation time obtained Jj * Reference 
from polypeptide backbone 15 N T X JT X ratios (13) is 4.5 ns-i.e., within the 
range encountered for proteins from about 5 to 15 kDa. 

Nine RD NMR experiments were used in conjunction with 3D HNNCACB (M, 14-17) . Fig. 1 surveys 
their names and the correlated chemical shifts. In the nomenclature of the RD NMR experiments 
underlined letters indicate chemical shifts obtained in a common dimension. Fig. 2 displays peak patterns 
observed in the projected dimensions, and Fig. 6 (which is published as supporting information on the 
PNAS web site, www.pnas.org) _degig^ transfer pathways. Three-dimensional 

]^C^(CO)NH>K^ sequential connectivities. Three-dimensional 

HN NCAHA (7),(jPWCA^and^ /wfroresidue connectivities, and 3D 

HNN (CO,CA ) (5) offers both intraresidue ^-"C* and sequential W-^C* connectivities. 
Three-dimensional HCCH-correlation spectroscopy (COSY) and total correlation spectroscopy 
(TOCSY) provide assignmen ts of aliphatic side chains, w hereas two-dimensional (2D) 
j^S (CGCDj^>nd (t HJQ CSY-relayed HCH-COST >rovide those of the aromatic spins. The NMR 
pulse schemes of the hitherto unpublished RD NMR experiments are provided in Figs. 7-12, which are 
published as supporting information on the PNAS web site. The maximal evolution times, as well as the 
resulting measurement times are given in Table 1. RD NMR experiments in which *H and 13 C are 
simultaneously observed in a projected dimension were acquired with' virtually the same maximal 
evolution times in t^Cfilf) to enable accurate matching of peak patterns (Fig. 2). For HNNCAHA and 
HNN(CO,CA), central peaks were derived from incomplete polarization transfer (6, 7). For others, C 
magnetization present at the end of the refocusing period of the initial polarization transfer from l H to 
13 C was recruited, which yields two subspectra containing the peak pairs and central peaks, respectively 
(Fig. 2). In view of potential peak overlap, proper setting of the radio-frequency (rf) carriers is crucial. In 
HN NCAHA, for example, this allows one to place central peaks, and up-field and down-field component 
of the peak pairs into separate spectral regions (7). When data are collected in this manner, peak overlap 
does not increase when compared with HNNC A. 
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experiments 



The relative sensitivity of NMR experiments was analyzed by determining the yield of peak 
detection—i.e., the ratio of observed peaks over the total number of expected peaks— and by separately 
assessing the S/N ratio distributions of peaks belonging to either peak pairs or central peaks. To rank the 
experiments (Table 1) according to sensitivity, only peaks encoding the prime information of a given 
spectrum were considered— e.g., //rtraresidue connectivities in HN NCAHA and HNNCACB, correlation 
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peaks in HCCH-COSY and relay connectivities in HCCH-TOCSY. The averaged S/N ratios were 
divided by the square root of the measurement time (Table 1) and scaled relative to the most sensitive 
experiment yielding peak pairs— i.e., HACA(CO)NHN. To avoid a bias from longer transverse relaxation 
times in several flexibly disordered terminal residues, the N-terminal octapeptide segment comprising 
residues "-13" to "-6" (in the numbering chosen in ref. 12) was not considered in these sensitivity 
analyses. 

The program AUTOASSIGN (V. 1 .7.3; refs. 10 and H) was extended for analysis of RD TR NMR 
experiments. The input included 3D peak lists derived from 3D HACA(CO)NHN, H^C* 7 *>(CO)NHN and 
HNNCAHA, a 3D HNNCO peak list from 3D HNM CO. CA) and the 3D HNNCACB peak list. Details of 
the computational protocol are available in Supporting Text, which is published as supporting information 
on the PNAS web site. The AUTOASSIGN program, the input RD NMR peak lists, and the output 
resonance assignment lists are available from the authors. 



► Results and Discussion 
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The data collection times and the relative sensitivities of the experiments are 
shown in Table 1. For those providing sequential connectivities—i.e., 3D 
HACA(CO)NHN, if^C^(CO)NHN, and HNN(CO ? CA> (Fig. l)-peak pair 
and central peak detection was complete (100%). Among the experiments 
providing m/roresidue connectivities, both 3D HN NCAHA and 3D 
HNNCACB likewise exhibited 100% yield, whereas a few were missing in H^C^COHA (yield, 98% of 
peak pairs; 91% of central peaks). In part, this is because magnetization is detected on the l W protons 
close to the water resonance. Peak pair detection in 3D HCCH-COSY was nearly complete (90% of peak 
pairs, 93% of central peaks), whereas the yield of relayed COSY peak pairs in 3D HCCH-TOCSY was 
slightly lower (81%). To some extent this was due to signal overlap and not the lack of sensitivity. Nearly 
complete signal detection was also observed in 2D HBCB(CGCD)HD (100%) and ^-TOCSY-relayed 
HCH-COSY (90% of peak pairs). The detailed S/N analysis revealed (/) outstanding sensitivity for 
detection of peak pairs in 3D HACA(CO)NHN, (//) about similar sensitivity for 3D H^C^(CO)NHN, 
HNN CAHA , HN MCO . CA ), HNNCACB (nowadays routinely used up to around 25 kDa), 
HCCH-COSY, and 2D ^-TOCSY-relayed HCH-COSY, and (///) reduced sensitivity for 3D 
H^C^COHA, 2D HBCB(CGCD)HD, and relay peak detection in 3D HCCH-TOCSY. (Note that 
analysis of the spectra for assignment of the aromatic spin systems was somewhat impeded by the small 
number of aromatic residues in Z-domain, as well as their partially flexibly disordered nature.) In 
accordance with the very high sensitivity of HNNCO, central peak detection in HNN(CO,CA> is by far the 
most sensitive and serves for secure spin system identification in cases of overlap in 2D 
[^N^HJ-heteronuclear sequential quantum correlation (HSQC) (5, 10). 

The analysis summarized in Table 1 allows one to devise a strategy for RD NMR-based HTP resonance 
assignment in which 3D H^C^(CO)NHN establishes sequential backbone connectivities and 
connectivities to both the aliphatic and aromatic side chains (Fig. 2). Firstly, the peak patterns along 
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w (13c**/1h**) in subspectra I and II of 3D H^C^(CO)NHN enable sequential resonance assignment in 
combination with HN NCAHA (Fig. 3) and HNNCACB, respectively. Complementary recording of 3D 
H^C^COHA and HNN (C(XCA ) contributes polypeptide backbone 13 C chemical shift measurements for 
establishing sequential assignments: the intraresidue 13 C correlation is obtained by t^^C^H*^ peak 
pattern matching of H^C^(CO)NHN with H^C^COHA, and the sequential 13 C correlation is inferred 
from (13)0*, 15 N, and ! H N chemical shifts in HNN(CO,CA) (see Fig. 7). Secondly, comparison of wj( 13 C 
-^H**) peak patterns with 3D HCCH-COSY and -TOCSY connects the C 3 /H^ chemical shifts with 
those of the more peripheral aliphatic side chain spins (Fig. 4), whereas comparison of (^(^cVh 0 ) peaks 

with 2D HBCB (CGCD)HD and subsequent linking with l H s chemical shifts detected in 2D 
^-TOCSY-relayed HCH-COSY affords assignment of the aromatic spin systems (Fig. 5). For many 
residues the two P-proton chemical shifts are nondegenerate, and the connection of H^C^(CO)NHN 
with HBCB (CGCD)HD or HCCH-COSY/TOCSY may then rely on comparison of the three chemical 
shifts of 1 H f * 2 , W 3 , and 13 C*\ In general, the identification of peaks pairs is complicated when chemical 
shift degeneracy in the other dimension occurs, but central peak acquisition (Figs. 2 and 3) addresses this 
complication in a straightforward fashion (6). Importantly, however, pairs of peaks generated by a 
chemical shift in-phase splitting have quite similar intensity. Usually this does not hold for two arbitrarily 
selected peaks, because the nuclear spin relaxation times vary within each residue as well as along the 
polypeptide chain. The peak pairs are thus "labeled" with the relaxation times, making the pair 
identification obvious in most cases (Figs. 3^5). This is also advantageous for automated peak picking. 
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The suite of the nine experiments outlined in Fig. 1 complemented by 3D HACA(CO)NHN forms a 
"standard set" for RD NMR-based protein resonance assignment. For Z-domain, the entire set was 
recorded within only 65 h by using a conventional TR NMR probe at 600 MHz (Table I). The high 
redundancy of these seven projected 4D, one conventional 3D, and two projected 3D spectra provides a 
very efficient resonance assignment strategy, which also profits from the fact that the detection of 
symmetric RD NMR peak pairs greatly facilitates the identification of peaks close to the noise level. 
Importantly, the information provided by projected 4D spectra cannot be obtained by recording twice the 
number of 3D spectra: in cases of chemical shift degeneracy a chemical shift quartuple is not equivalent to 
two shift triples. 

It is of significant practical advantage that the sensitivity within the standard set varies only by about a 
factor of two (Table I). This facilitates the prediction of minimal required measurement times (roughly a 
multiple of the measurement time of a single experiment). In fact, the S/N ratios observed in the first 
experiment allow one to adjust measurement times while data acquisition is in progress. For Z-domain, 
six RD NMR experiments were actually sufficient to provide complete backbone and side chain resonance 
assignments (3D H^C^(CO)NHN, HNNCAHA, HCCH-COSY/TOCSY, 2D HBCB(CGCD)HD, and 
^-TOCSY-relayed HCH-COSY recorded in 36 h; Table 1), and those can be considered as a "minimal 
set" for proteins up to around 10 kDa. As expected, chemical shifts agreed well with those previously 
obtained at 30°C by using conventional TR NMR (12). 

For proteins above &15 kDa, recording of highly sensitive 3D HACA(CO)NHN promises (/') to yield spin 
systems that escape detection in H^C*^(CO)NHN, and (//) to offer the efficient distinction of a- and 
P-moiety resonances by comparison with H^C^(CO)NHN. Moreover, Nietlispach et al (18) have 
recorded 4D H 3t/i *C t ^(CO)NHN for 50% random fractionally deuterated proteins reorienting with 
correlation times up to around 20 ns (corresponding to ^20-30 kDa at ambient temperature). Thus, the 
3D H*^C^(CO)NHN, or a transverse relaxation optimized version thereof (19), may well maintain the 
role outlined in Fig. 2 for assigning partially deuterated proteins at least up to about that size, in particular 
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when using cryogenic probes ( 20) . 



Cryogenic probes reduce measurement times by about a factor of 10 or more (21). In the 
sensitivity-limited regime, where sampling requirements do not provide a bound for the minimal 
measurement time, the required signal could have been recorded for the standard set often experiments in 
about 6.5 h (Table I). Using the current implementations (Table 1), the standard set could have been 
recorded with a single transient per increment and without central peak detection, considering the "spin 
relaxation time labeling" of peak pairs. This would then reduce the total measurement time to about 18 h. 
Hence, RD NMR promises to allow a rather close adjustment of measurement times to sensitivity 
requirements. 

HTP employment of RD NMR requires strong computer support for data analysis. For the backbone and 
l3 C* resonances, this issue was addressed by extending the program AUTOASSIGN (10, U) to analyze 3D 
HACA(CO)NHN, H^C^(CO)NHN, HNNCAHA, HNN(CO,CA>, and HNNCACB (recorded in 33 h; 
Table 1). AUTOASSIGN determined 94.5% (345 of 366) of all backbone ! H N , 15 N, 13 C = O, 13 C, 13 C*, 
and 1 H* chemical shifts with an error rate of only 0.9%, and obtained at least three chemical shifts for 
61 of 63 residues (97%). Evidently, the good spectral resolution of the RD spectra (Table I) greatly 
supported the efficient automated analysis. Importantly, the chemical shifts rapidly obtained from 
AUTOASSIGN directly revealed the protein's secondary structure (22): the helical boundaries derived 
independently from this chemical shift analysis and the NMR solution structure (12) are virtually identical. 
The extension of AUTOASSIGN for assigning side chain chemical shifts is currently in progress. This will 
allow one to automatically obtain nearly complete resonance assignments of proteins. 
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prediction (23), protein target selection, and construct optimization in B==^ 
structural genomics. Considering that (/) sensitivity and sweep widths increase with increasing magnetic 
fields and that (//) the widespread use of cryogenic probes will greatly boost the sensitivity of our 
spectrometers, we expect a "change in paradigm" in biological NMR spectroscopy with a new focus on 
research addressing the caveat of sampling limitation. Data processing protocols reducing the number of 
data points in the indirect dimensions without sacrificing spectral resolution, such as linear prediction and 
maximum entropy methods (24, 25), nonlinear sampling (25, 26), and possibly the recently introduced 
filter diagonalization method (27, 28), appear to be of keen interest to further enhance the impact of RD 
NMR. 



RD NMR is a powerful approach to avoid the "sampling limited acquisition 
regime." This is of outstanding interest in view of the forthcoming era of 
cryogenic probes. In particular, the resulting rapid determination of protein 
secondary structure from chemical shifts (22) will greatly support fold 
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